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$_i Abstract 

a 

This work is devoted to direct mass transportation proofs of families of functional inequalities in the context 
of one-dimensional free probability, avoiding random matrix approximation. The inequalities include the free 
form of the transportation, Log-Sobolev, HWI interpolation and Brunn- Minkowski inequalities for strictly convex 
potentials. Sharp constants and some extended versions are put forward. The paper also addresses two versions 
of free Poincare inequalities and their interpretation in terms of spectral properties of Jacobi operators. The last 
part establishes the corresponding inequalities for measures on K + with the reference example of the Marcenko- 
ry ^ Pastur distribution. 

1 Introduction 



i ** H i A distinguished role in the world of functional inequalities is played by the logarithmic Sobolev (Log-Sobolev) 
inequality and the Talagrand or transportation cost inequality. There is an extensive literature dedicated to 
these inequalities in the classical setting of Euclidean and Riemannian spaces (cf. e.g. [2], [23], [29], [32]). 

Given a probability measure v on R d , the transportation cost inequality states that for some p > and any 
other probability measure p, on R d , 

pWSfav) <E(n\u). (T(p)) 
Here Wa(/i, v ) IS the Wasserstein distance between p and v of finite second moment defined by 

m 

W»(n,v)= inf ( f[ \x-y\ 2 7r(dx,dy)] 
with v) denoting the set of probability measures on M. 2d with marginals fi and v and 

'X 



EW) = J lo § % d ^ 



is the relative entropy of \i with respect to v if /i << v and +oo otherwise. The Log-Sobolev inequality is that 
for any /i 

E{ix\v)<^- p I{ix\v) (LSI(p)) 



where 

/(,M=/|v.o g f 



(If I. 



is the Fisher information of p with respect to v which is defined in the case \i « v with being differentiable. 
A more subtle inequality is the HWI inequality relating entropy ( notice that E(p\v) is H(p\v) in [2 ] which 
explains the H), Wasserstein distance W, and Fisher information I 
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Poincare's inequality in this classical context is that for any compactly supported and smooth function ip on R d , 



pVarM) < / |V^| 2 M (^) (P(P)) 



where Var AI (-0) = J %jj 2 {x) p{dx) — (J ip(x)p(dx))^j 2 is the variance of ip with respect to p. 

Starting with Gaussian measures ([14], [28]), these inequalities were established for measures on R d with 
strictly convex potentials by the Bakry-Emery criterion ([2], [23], [29], [32]). More precisely, if v{dx) = e~ v ^dx, 
with V(x) — p\x\ 2 convex on R d for some p > 0, both T(p) and LSI(p) hold true. Otto and Villani generated 
interest in this topic through their remarkable paper [25], in which they showed that the logarithmic Sobolev 
inequality implies the trasportation inequality, in a rather general setting. This connection was actually put 
further through the stronger HWI(p) inequality, which was shown in [25] to be valid in the case V(x) — p\x\ 2 
is convex for some p € R, When p > 0, LSI(p) is a consequence of HWI(p). Subsequently the main result 
from [25] was simplified and extended, for example [5] and recently [13] to mention only two sources. Another 
interesting connection in these families of functional inequalities is that any of T(p), LSI(p) or HWI(p) imply 
the Poincare inequality P(p). 

The work [25] by Otto and Villani input in a powerful way the use of mass transportation ideas in the 
context of functional inequalities. Starting from this, Cordero-Erausquin used in [!)] direct convexity arguments 
combined with mass transport methods to reprove the Log-Sobolev, transportation and HWI inequalities for 
measures with strictly convex potentials. The strategy is going back to the original approach of [28] to the 
transportation inequality (see also [4]). 

In the world of free probability, as it was shown by Ben Arous and Guionnet in [1], one can realize the free 
entropy as the rate function of the large deviations for the distribution of eigenvalues of some n x n complex 
random matrix ensembles (see also [19]). To wit a little bit here, let V : R — * R be a nice function with enough 
growth at infinity and define the probability distribution 

¥ n {dM) = — e -" Tl "( y ( M »dM 

on the set Tt n of complex Hermitian n x n matrices where dM is the Lebesgue measure on 7i n . For a matrix M, 
let p n (M) = - ^2k—i S\ k (M) be the distribution of eigenvalues of M. These are random variables with values in 
V(M), the set of probability measures on R which converge almost surely to a non-random measure py on R. 
For a measure p on R, its the logarithmic energy with external field V is defined by 

e (p) = J V(x)p(dx) -JJ \og\x- y\ p(dx)p(dy). 

The minimizer of E(p) over all probability measures on R is exactly the measure py . From [1] we learned that 
the distributions of {p n }n>i under P n satisfy a large deviations principle with scaling n 2 and rate function given 
by 

R(p) = E(p) - E(p v ) 

The example of the quadratic potential V{x) — x 2 defining the paradigmatic Gaussian Unitary Ensemble in 
random matrix theory gives rise to the celebrated semicircular law as equilibrium measure. 

Within this random matrix framework, if V(x) — px 2 is smooth and convex for some p > 0, then the function 
<j)(M) = Tr n (V(M)) is strongly convex ($(M) - np\M\ 2 is convex) on R™ 2 = H n . An application of the 
classical LSI(np) on 7i n for large n was used by Biane [ ] to prove a Log-Sobolev inequality in the context of 
one-dimensional free probability which holds (cf. [ ]) in the following form 

E(p) - E(py) < ^ I(p) (1.1) 

4p 

for any probability measure p on R whose density with respect to the Lebegue measure is in L 3 (R), where 
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I(Ai)= J {Hp(x)-V / (x))p(dx) 

with Hp = 2 J -^z^p(dx) being the Hilbert transform of p. 

More precisely, Biane and Voiculescu used the free Ornstein Uhlenbeck process and the complex Burger 
equation. Using the large random matrix strategy, Hiai Petz and Ueda [18] reproved and extended the result of 
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Bianc and Voiculescu in the following form. If V(x) — px 2 is convex for some p > 0, then for every probability 
measure /i onl, 

P W 2 (p,p v )<E(p)-E(p v ). (1.2) 

Later, the first author [24] gave a simpler proof of (1.1) and (1.2) based on a free version of the geometric 
Brunn-Minkowski inequality obtained as a random matrix limiting case of its classical counterpart. He also 
showed the free analog of the Otto-Villani theorem indicating that the free Log-Sobolev inequality implies the 
free transportation inequality (1.2). 

The first scope of this paper is to provide direct proofs of the preceding functional inequalities in free 
probability without random matrix approximation. The second author of this paper in [26] gave a simple proof 
of the transportation inequality (1.2) on the same line of ideas as in [28] for the classical case where random 
matrix theory is entirely avoided. 

In this paper, following the approach of Cordero-Erausquin [9] (see also [4]), we use a combination of mass 
transport and convex analysis which apply to strictly convex potentials. The methods allow us besides to 
enlarge the class of potentials under consideration, in particular in instances which lack a proper random matrix 
approximation. For example, we cover potentials V on the line such that V(x) — p\x\ p is convex for some p > 
and p > 1 as well as a class of bounded perturbations of convex potentials. Using this approach, we present here 
an HWI free inequality for various cases of potentials. For the case V(x) — px 2 convex for some p £ R, this is 

E{p) - E{p v ) < ^J[pT)W 2 {p,p v ) - P W 2 {p,p v ). (1.3) 

Also a Brunn-Minkovski inequality receives a direct proof as well. 

One interesting byproduct of our method is that some constants may be shown to be sharp. For the case of 
a quadratic V, equations (1.1), (1.2) and (1.3) are sharp. 

Another topic discussed here in Section 3 is a free form of the transportation inequality which does not 
depend on the potential and that might be thought of as a version of the celebrated Pinsker inequality comparing 
total variation distance and entropy between probability measures. As opposed to the classical case, the free 
counterpart is more delicate. 

The second part of this work is devoted to free one-dimensional Poincare inequalities. Using random matrix 
approximations and the classical Poincare inequality, we first give an ansatz to what could be a possible Poincare 
inequality in the free probability world. In the case of V(x) — px 2 convex for some p > 0, such that the measure 
pv has support [—1,1], this states as, 

WM*,) > JL f t (m^M) 2 ^^y dxdy , (i.4) 

2tH y_ a J_ 1 \ x-y J Vl - x 2 ^fl - y 2 

for any smooth function cj> on the interval [—1, 1]. 

There is also a second version of the Poincare which is discussed in [3] for the case of the semicircular law. 
This inequality has a natural meaning in the context of free probability as the derivative of a function from 
the classical P(p) is replaced by the noncommutative derivative ^ x lZy^ , an d thus our second version takes the 
form 2 

( ^Zy^ ) Kdx)p(dy) > CVar^) for every <j6 £ C£(K). (1.5) 

As opposed to (1.4) which requires certain conditions on the measure pv , it turns out that (1.5) is always satisfied 
for any compactly supported measure p with some constant. As was shown in [ ] for the semicircular law, one 
can completely characterize the distribution in terms of the constant C. 

After the use of convexity, inequality (1.4) may actually be interpreted as a spectral gap as follows. On 

L 2 ( ll ^/ir^? ~) take t ne Jacobi operator 

L.f =-{l-x 2 )f"{x)+xf'{x) 
and the counting number operator defined by 

NT n = nT n 

where T n are the Chebyshev polynomials of the first kind, which arc orthogonal in L 2 ( 1[ ~^^s ~^j ■ Then, (1.4) 
for V(x) = x 2 /2 is equivalent to 

L > N. 
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Inequality (1.5) in the case of V(x) = x 2 /2 can also be seen as the spectral gap for the counting number 
operator on L 2 (l[-2.2]{ x )V4 — x 2 dx) with respect to the basis given by the Chebyshev polynomials of second 
kind. A more general situation is discussed in Section 9 which includes both versions of the Poincare inequalities. 

As we mentioned already, in the classical setting, the Log-Sobolev and the transportation inequality imply 
the Poincare inequalities. We do not have a satisfactory picture of these implications in the free context, for any 
of the two versions of the Poincare inequality discussed here. 

In the final part, we investigate the preceding families of functional inequalities for probability measures 
supported on the positive real axis. The random matrix context is the one of Wishart ensembles with reference 
measure the Marcenko-Pastur distribution as opposed to the semicircular law, and the free functional inequalities 
correspond formally to the case of potentials V(x) — rx — slog(x) for r > 0, s > on R + . Using the mass 
transportation method, we prove transportation, Log-Sobolev and HWI inequalities which were not investigated 
previously. A version of the Poincare inequality is also discussed. 

The structure of the paper is as follows. Sections 2, 4, 5 and 6 deal with the mass transportation proofs 
of respectively the transportation, Log-Sobolev, HWI and Brunn-Minkowski inequalities. Section 3 studies 
transportation inequalities which involve some metric on the probabilities and which are independent of the 
potential V. Sections 7 and 8 are devoted to the two versions of the Poincare inequality in the free context, 
related in Section 9 through Jacobi operators. Section 10 investigates the preceding inequalities with respect to 
the Marcenko-Pastur distribution and its convex extensions. 

2 Transportation Inequality 

Throughout this paper we consider lower semicontinuous potentials V : R — > R such that 

lim (V(x) -21og|x|) = oo. (2.1) 

\x\ — >oo 

For a given Borel set r C R, denote by V(T) the set of probability measures supported on L. 
The logarithmic energy with external potential V is defined by 

E v(fJ>) ■= J V(x)fi(dx) ~JJ \og\x- y\ (j,(dx)(i(dy). 

whenever both integrals exist and have finite values. In particular for measures (i which have atoms, Ey(/S) = +°° 
because the second integral is +oo. 

It is known (see [27] or [11]) that under condition (2.1) there exists a unique minimizcr of Ey in the set 
V(M) and the solution fiy is compactly supported. The variational characterization of the minimizer fiy (cf. 
[27, Theorem 1.3]) is that for a constant Cel, 

V(x) > 2 J log \x — y\ fiv(dy) + C for quasi-every x € K 

V(x) — 2 J log \x — y\ fiv(dy) + C for quasi-every x € supp(/iy), 



(2.2) 



where supp(/iy) stands for the support of /x. If /i is such that Ey(n) < oo, then Borel quasi-everywhere sets 
have n measure and thus the properties above hold almost surely with respect to fi. 

For simplicity of the notation, we will drop the subscript V from Ey unless the dependence of the potential 
has to be highlighted. 

Now we summarize some known facts about the equilibrium measure and its support as one can easily deduce 
them from [27, Chapter IV] and [11, Chapter 6]. 

Theorem 1. 1. Let V be a potential satisfying (2.1) and a ^ 0,(3 £ K. Set V a ^(x) — V(ax + (3). Then, 
jJLy a ff = ((id — P)/ct)#(J<v an d 

E v (n v ) = E Va0 (n Va0 ) -log|a|. (2.3) 

2. IfV is convex satisfying (2.1), then the support of the equilibrium measure \iy consists of one interval [a,b] 
where a and b solve the system 

£J b a V'(*)^dx = l 



r r b V'(x)J£=*dx = -l 

In J a V ' V x—a 



(2.4) 
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3. Let V be either a C 2 satisfying (2.1) whose equilibrium measure has support [a,b]. Then the equilibrium 
measure pv has density g(x), given by 

^(x-a)(b-x) [» V'(y)-V'(x) 

g{x) = l [aM {x) — / , , =dy. (2.5 

2lT Ja (y - x)y/(y - a)(b - y) 

4. IfV is C 2 , then 

V(x)=p.v. / pv(dx) for py — a.s. allx £ supp(pv), (2-6) 

J x ~ y 

where p.v. stands for the principal value integral. Notice that the principal value makes sense as py has a 
continuous density. 

We mention as a basic example that if V(x) = px 2 is quadratic, then py is the semicircular law 



pv(dx) = 1 ,-r- r-- (x) yj2p - p 2 X 2 



dx 

[-y/2/p^2/ P ]^' v "f y ~ ^ ■ 

In this work, for p > 1, we use W p (p, v) for the Wasserstein distance on the space of probability measures on 
defined as 

l/p 



W p (p,v) = inf / / \x-y\ p n(dx,dy)\ (2.7) 
7ren(/i,i/) \J J J 

with II(/i, v) denoting the set of probability measures on W 2 with marginals p and v. Note here that if 9 is the 
(non-decreasing) transport map such that 6#p = is, then 

WP(p,v)= J \0{x) -x\ v v(dx). (2.8) 

For a detailed discussion on this topic we refer the reader to [29] . 

Our first result concerns the free version of the transportation cost inequality. As discussed in the introduction, 
the first assertion for strictly convex potentials was initially proved by large matrix approximation in [18]. The 
strategy of proof is inspired from [28], [4] and [9] (see [26]). 

Theorem 2 (Transportation inequality). 1. If V is C 2 and V(x) — px 2 is convex for some p > 0, then for 
any probability measure p on K, 

pW 2 (p,p v )<E(p)-E(p v ). (2.9) 

IfV(x) — px 2 , then the equality in (2.9) is attained for measures p = 9#pv, with 6{x) = x + m, therefore 
the constant p in front of W^ip, pv) is sharp. 

2. Assume that V is C 2 , convex and V"(x) > p > for all \x\ > r. Then, there is a constant C — 
C(r, p, pv, V) > 0, such that 

CW 2 (p,p v )<E(p)-E(p v ). (2.10) 

3. In the case V is C 2 and V{x) — p\x\ p is convex for some real number p > 1, then, for any probability 
measure p on R, 

c pP WP(p,p v ) < E(p) - E(p v ) (2.11) 
where c p = inf^gn (|1 + x\ p — \x\ p — psign(a;)|a;| p_1 ) > 0. 

Proof. 1. Since there is nothing to prove in the case E{p) = oo, we assume that E(p) < oo. In this case we 
also have that the measure p and pv both have second finite moments. 

Now we take the non-decreasing transportation map 9 such that 9#pv = p which exists due to the lack of 
atoms of pv- Using the transport map 9, we first write 

E(p) - E(p v ) = J (V(9(x)) - V(x) - V'(x)(9(x) - x))p v {dx) (2.12) 

8(x)-0(y) , 9(x)-9(y)\ , , , 

— — - 1 - log — — p v (dx)p v (dy) 

x-y x-y J 

where in between we used the variational equation (2.6) to justify that 

J V'(x) (9(x) - x)p v (dx) = 2 JJ 6 M^. p v (dy)p v (dx) = J J { ° {x) - ^ ~^ %) - v) p v (dy) p v (dx) . 
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Since V{x) — px 2 is convex, for any x, y the following holds 

V(y) - V(x) - V'(x)(y - x) > p(y 2 - x 2 - 2x{y - x)) = p(y - xf . 

On the other hand since a — 1 > log(a) for any a > 0, equations (2.12) and (2.8) yield (2.9). 

In the case V(x) = px 2 it is easy to see that for 9(x) = x + m, all inequalities involved become equalities. 

thus we attain equality in (2.9) for translations of \iy . 

2. We start the proof with (2.12), whereas this time we need to exploit the logarithmic term to get our 
inequality. The idea is to use the strong convexity where ^p(x) :— 9{x) — x takes large values and for small 
values of ip(x) we try to compensate this with the second integral of (2.12). 
Notice in the first place that by Taylor's theorem we have that 



V(y) -V(x) -V'(x)(y-x) = {y - x) 2 I V" ({I - t)x + ry)(l - r)dr. (2.13) 

Jo 

Now, let us assume that the support of the equilibrium measure py is [a, b). Next, V"(x) > and V"(x) > p 
for \x\ > r, implies that for \y\ > 2r + 2max{|a|, \b\}, we obtain that 

V(y)-V(x) -V'(x){y-x) > {y - x) 2 f V"((l - t)x + ry){l - r)dr > p(y - x) 2 /8 for any x G [a, b] . 

Jl/2 

Now write 9(x) — x + i/j(x). Thus using (2.12), and denoting R = 2r + 2max{|a|, |&|} we continue with 
(V(9(x))-V(x)-V , (x)(9(x)-x))p v (dx)>^ J t/j 2 (x) V" (x + Tip(x)){l - T)drp v {dx) 

> [ i> 2 (x)p v (dx). (2.14) 
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This inequality provides a lower bound of the first term in (2.12). Further, it is not hard to check that 



Tp 2 {x)p v {dx) = ^ j l w > R (x)ip 2 (x)p v (dx) + i / l M > R (y)ifj 2 (y)p v (dy) 



W>R 

1 



~ 1 \i>(x)-4>(y)\>2R( x >y)\^( x ) - *P(y)\ 2 Vv(dx)p v (dy). (2.15) 

Now we treat the second integral on the left hand side of (2.12). Use that t — log(l + 1) > \t\ — log(l + |i|) 
for any t > — 1 together with the fact that t — log(l + t) is an increasing function for t > to argue that 

i)){x)-^{y) ( ip(x) - ip(y)\\ , , s 

- log 1 H p v {dx)pv{dy) 



x — y \ x — y 

> 



Further, for s > and u, v > we have 



us 2 + s - log( I + s) > l v 2 ~ ~ >mnJu, ^ U s 2 



This inequality used for u — p ^' 2 °- ) and v — in combination with (2.15) and (2.16) yields for the 
choice of c = min{u, (v — log(l + v))/v 2 } that 

i> (x)pv(dx) + J J I — logllH — I I pv(dx)p v (dy) 

(2.17) 

>c I I (tp(x) - ip{y)Y p v {dx)p v {dy) = c / tp 2 (x)p v (dx) - ( / ip(x)p v (dx) 
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This shows that E(p) — E(pv) is bounded below by a constant times the variance of i/j. Notice that 
W2(p,pv) = J i{} 2 (x)pv(dx) and in order to complete the proof we have to replace the variance of tj) by 
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the integral of ip 2 with respect to /xy . This boils down to estimating the pv integral of ip in terms of the 
integral of ip 2 . 

To this end, use Cauchy's inequality: 

i>(x)n v (dx)\ < J i(j 2 (x) ^1 + i- J V"(x + Tip(x))(l - r)drj p v {dx) 

-pv(dx). 



1 + h So v "( x + T ^( x ))( l - T ) rfr 

This inequality combined with equations (2.12), (2.14) and (2.17), results with 

E{p) - E{p v ) > J^ 2 ( x )( c+ \J o V"{x + T^{x)){l-T)d^j p v {dx) 

f^V"(x + Ttp(x))(l-T)dl 



2c + Jg 1 V"(x + np(x))(l - T)dr, 



fj,v(dx) 



>c / — i u v (dx)W 2 (p,pv), 

1 2c+fiV"(x + -nl>{x))(l-T)dT 



where here we used the convexity encoded into V" > and the fact that W|(/i, nv) = J ip 2 (x) fiy (dx) to 
get the lower bound of the first integral. 

From the previous inequality, it becomes clear that we are done as soon as we prove that the quantity in 
front of W 2 ([i, Hv) is bounded from below by a positive constant uniformly in ip. To carry this out, notice 
that V" can not be identically zero on [a,b]. Indeed, if V" were identically zero on [a, b], then we would 
have that V'(x) — K for all x £ [a, b], and this plugged into equation (2.4), yields that K (b — a) = 2 and 
K(b — a) = —2, a system without a solution. Therefore V" is not identically on [a, b}. If |^(x)| > R, then 
V"(x + rip(x)) > p for 1/2 < r < 1, which implies J* V"(x + Tip(x))(l - r)dr > p/8. On the other hand, 
if \ip(x)\ < R, then 

V"(x + T^(x)){\~T)dT> I V"(x + Tip(x))(l -T)dr > - uif V" {y) 

Jo 2 \y-x\<SR 



for all < 5 < 1. Define 



w(x)= sup minj^,^ inf V"(y)\ 

Se[0A] I o Z \y-x\<SR ) 



Since V" is not identically on [a, b], it follows that w is not identically zero on [a, b]. With this we obtain 
that 



and then that 



V"(x + Ttp(x))(l -r)dr > w{x) > 0, 



^V"{x + r^{x)){l-T)dr f cw{x) 

pvydx) > C = I — — pv\dx) > 



2c+/ 1 V"(a; + TV'(a;))(l-r)dr J2c + w(x) 

which finishes the proof of (2.10) with this choice of C. 

3. For the inequality (2.11), we follow the same route as in the proof of (2.9), the only change this time being 
that V(x) — p\x\ p is convex, and thus we obtain 

V(y) - V(x) - V'(x)(y - x) > P (\y\ p - \x\* - psign^)^^" 1 ^ - x)). (2.18) 

Writing ^(ar) = x + ip(x), and using (2.12) together with a — 1 > log(a) for a > 0, one arrives at 

E{p)-E{p v )>p J (\x + i){x)y ~\x\p -psign{x)\x\v- l ^{x))ii v {dx). 

Now we use the fact that for all a, b £ R, 

\a + bf - \b\ p -psign(fe)|6| p - 1 a > c p \a\ p , (2.19) 
which applied to the above inequality in conjunction to (2.8), yields inequality (2.11). □ 
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Remark 1. 1. The C 2 regularity of V for (2.9) can be dropped (see [26]) but to simplify the presentation 
here we decided to consider only this case. 

2. IfV(x) — p\x\ p is convex, then using inequalities (2.11), (2.10) and Young's inequality we obtain that for 
any 2 < k < p, there exists a constant c = c{k,p, p, py ,V) such that 

eWl:(fjL,^ v )<E(fjL)-E{fjL V ). 

3. We want to point out that the inequalities (2.11) and (2.10) are somehow complementary to each other. 
For example, if we take V{x) = p\x\ p with p > 1 and the measure p — 0#pv for 9{x) = x + m, then 
equation (2.11) takes the form 

c p m p < J (\x + m\ p - \x\ p )p v (dx) (2.20) 

while equation (2.10) becomes 

Cm 2 < J (\x + m\ p -\x\ p )(i v (dx), 
which, because it is easy to check that py is symmetric, is the same as 

Cm 2 < [ (|a; + m| p - \x\ p - psig^x^x^jn) p v {dx). (2.21) 



Notice here that (2.20) is in the right scale for large m as (2.21) is in the right scale for m close to 0, 
because in this case the integrand is of the size m 2 . It seems that Talagrand's transportation inequality in 
this context has two aspects, one is the large W p (fi, fXv) which is dictated by the potential V for large values 
and results with equation (2.11) and the small Wi{p-, Pv) regime which is dictated by the repulsion effect 
of the logarithm and results with equation (2.10). 

4-. It is not clear whether inequality (2.10) still holds for the case of a potential V which is not convex. Of 
interest would be the particular case V{x) — ax* + bx 2 for some a > and b < 0. This example actually 
raises the question of the stability of transportation inequality under bounded perturbations. 

5. Very likely the constant c p in (2.11) is not sharp. 



3 Potential Independent Transportation Inequalities 

In this section, we investigate some potential independent transportation inequalities. A transportation inequal- 
ity in the form of (2.10) can not possibly hold without a quadratic growth at infinity. Also, the proof of (2.10) 
might lead to the conclusion that the logarithmic term plays a more important role. Therefore the natural ques- 
tion one may ask is whether there is a manifestation of this fact in some sort of transportation type inequality 
which is independent of the potential involved. The main question reduces to hint some appropriate distance one 
needs to use to replace the Wasserstein distance in Theorem 2. We investigate in this section several possibilities, 
starting with the free version of the classical Pinsker's inequality. 
The Pinsker's inequality classically states that (cf. [10] and [21]) 

2\\p — i/\\ 2 < E(p\v) for any p, v probability measures on R, 

where \p — v\ v is the total variation distance between p and v and E(p\v) is the relative entropy between p and 
v. This in particular shows that if fi n convergence to /i in entropy, then p, n converges to /i is a very strong sense. 

The same natural question can be posed in the logarithmic entropy context. For a given potential V, is there 
an inequality of the form 

C\\p,-^ V \\ 2 V <E(^-E(fx v ) 

for a given constant C > and any probability distribution /i on R? 

It turns out that these inequalities do not hold for the logarithmic energy. In fact, we will show that even a 
weaker inequality of the form 

C\F^-F^ v \ 2 u <E(v)-E(tiv) (3.1) 

does not hold, where F^ denotes the cumulative function of a probability measure \i on the line. Even though 
the uniform distance does not have the same widespread use in probability it appears for example in the Berry- 
Esseen type estimates for the convergence in the central limit theorem. This is the reason why we consider this 
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distance as the first next best candidate wherever the total variation fails. Clearly this metric gives a stronger 
topology as the topology of weak convergence. 

Will construct a counterexample to (3.1) in the case of V(x) — 2x 2 , for which the equilibrium measure is 



2y/l — x 2 
Hv{dx) = dx, 



the semicircular law on [—1,1]. Consider now the sequence 



LL n (dx) = lr_i i](x) dx + '- lK - A — i \ / ^ 

where Tk is the fc t?i Chebyshev polynomial of the first kind. With these choices we have that 

- E^v) < ^ - for all n > 4. (3.2) 

Let us point out that is indeed a probability measure. This requires a little proof but it's entirely 
elementary and is left to the reader. 

To prove (3.1), notice that since the support of ji n is the same as the support of /iy, we have from (2.2) that 



E(nn) - E(nv) = ~JJ log l x - vlif^ ~ Hv){dx){pL n - nv){dy). (3.3) 
Next remark that fi n — cos#(/„A) and \iy = cos#(<?A), where A is the Lebesgue measure on [0, it] and 



„ , , 1 — cos(2i) 1 2 v-^ 1 , « . . , 1 — cos(2i) 

fn(t) = + ~ E (-l) fe COs((2fc + l)t), ff(t) - 

1 ' fe=2 

and further 



log |x - y|(/in - ^v){dx)(ii n - nv){dy) = - / / log | cost - cos s\h n (t)h n (s)dtds where h n = f n -g. 

Jo Jo 

Now we provide a formula for the logarithmic energy we learnt from [15] and have not seen it elsewhere. 
Here is a quick description. Write first cost = (e lt + e~ lt )/2 and coss = (e zs + e~ ls )/2 so |cos£ — coss| = 
\{e u + e~ lt )/2- {e ls + e~ is )\/2 = |1 - e l (*+ s )||l - e l{ - t ' sS >\/2 and so, for t ^ s, and t or s not equal to it, 

OO 

log | cost - cos s| = - log 2 + Re(log(l - e 4 < t+s )) + log(l - e l( *- s) )) = - log 2 - Rc(e u{t+s) /£ + e ll(t - s) /£) 

oo 2 

= - log 2 - ^2 - cos(£t) cos(£s). 

i=\ 

From this, one gets to 

PTT P"K 00 ^ / fit \ 2 

— / / log | cos t — cos s\h n (t)h n (s) dtds = - I / cos(£t)h n (t)dt ) . 

Jo Jo £=1 ' Vjq / 

But now, 

1 ^ t r ,„ , { ~JiL n 4<£<4nandodd 



(3.4) 



otherwise 



/ cos(£t)h n (t)dt = 2 ^ (-l) fc / cos(A) cos((2fc + l)t)dt = ^ sf™ 2 - 1 ) 
and thus 

Jo /o log l cos * _coss ^"^™( s ) dtrfs = X] J (j Q cos(£t)h n (t)dtj = 1)2 53 2^+1 ' ^'^ 
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On the other hand \F^ - F^, \ u = \F fn \ - F gX \ u = sup xg[0i7r ] | J* h n (t)dt\ and 



h n (t)dt = 



1 



2n-l 



47r(n 2 - 1) 



1=1 



(-l) f sin((2l+l)x) 
2^+1 



from which for x = 7r/4, we obtain 



Combining (3.5) and (3.6) we get 

^2 



sup 

ze[0,7r] 



h n (t)dt 



2n-l 



> 



1 ^ 1 



2 2^=2 2l+l 



> 



log |sc - y\(fj, n - iJ,v)(dx)(n n - nv){dy) 



(3.6) 



(3.7) 



which together with the fact that Y^z=2~ 1 2I+1 — I l°g( n /3) for n > 4 and (3.3), we finally arrive at (3.2). 

The example shown above has the property that E(fi n ) — E(/j,v) converges to when n goes to infinity, 
and also that {F^ — Ffj, v \ u converges to zero. Despite the fact that (3.1) does not hold, we will see below in 
Corollary ?? that if E(fi n ) — E(nv) converges to 0, then — F^ v \ u always converges to 0. 

We consider now a weak form of (3.1). To do this we define the distance 



d(n, v) = sup 



-|a:r+&| 



fj,(dx) 



- \ax+b\ 



v(dx) 



(3.8) 



With this definition we have the following result. 
Theorem 3. For any potential V satisfying (2.1), we have that for any compactly supported measure \i, 

4^ 3 d 2 (/i, nv) < E(jj) - E{ii v ). (3.9) 

Proof. Using equations (2.1) and (2.2), we get for any compactly supported measure [i with E(/j.) finite, 

E{n) - E{nv) > - J J log I a; - y\(fi - fj, v )(dx)(n - fi v )(dy). 

We will prove that for any measures \i and v with compact support such that — J J log \x — y\fj,(dx) (i(dy) < 00 
and — //log \x — y\v{dx)v{dy) < 00, we have that 



47r J cP( M , v)<- / / log \x - y\(n - u)(dx)(n - u)(dy), 



(3.10) 



which shows that (3.10) implies (3.9). 

Now we use [11, equation 6.45] to write 

log \x - y\(fi ~ fi v )(dx)(n - Vv){dy) = 
where the hat stands for the Fourier transform, and continue with 



\fi{t) - fi v {t)f 



dt 



\m~m\ 2 dt= 1 
t 2 



\m-Ht)\ : 



dt > \a\ 



a 2 + t 2 " 77 



(£(*) ~ *(*)) e~ 
a 2 + t 2 



for any a,c £ R with a =/= 0. Further, using the inversion formula for the Fourier transform, one has 



(fi(t) - v{t)) e- 
a 2 + t 2 



dt = 2tt / cj)(x)(fi — v)(dx) = 



2tt 2 



i-\ a ( x + c )\(^- v )(dx) 



(3.11) 



dt 



(3.12) 



because for <j>(£) 



a 2 + t 2 ' 



(x) = 



e i(x+c)t 7re -\a(x+c)\ 
75 o dt = f" ; 

t 2 + a 2 a 



From here, (3.10) follows immediately. 



□ 



10 



Remark 2. From equation (3.11) it seems that the distance one should consider should be the Sobolev norm with 
exponent —1/2. This is another possible candidate to the role of d played here, however not always finite. We 
chose the metric d as it's definition is somehow close to uniform norm of the difference of the Laplace transforms 
of the measures. It is also always defined and bounded by 1, thus resembling the total variation distance. 

The next result is collecting facts about how strong the topology induced by d is. 

Proposition 1. 1. d is a distance on V(R) and if d([i n , fi) > 0, then [i n > fi in the weak topology. In 

n — >oa n — >oo 

addition d(d a , St) — 1 for a ^ b, thus the topology induced by d is strictly stronger than the weak convergence 
topology. 

2. For any two probability measures /i and v, 

d(n,u)<2\F fl -F v \ u . (3.13) 

3. If V satisfies condition (2.1), then Ey(fi n ) > Ev(fJ-v) implies \Fu„ — Fu v \u y 0- 

Proof. 1. To prove that d is a distance the only non trivial fact is that for two probability measures fi and v, 
d(fx, v) = implies /i = v. Thus from equation (3.12), we obtain for a = 1 that for all cel. 

(A(«)-^))e-" 
1 + t 2 

Since this holds true for any c G M, it implies that the Fourier transform of the function t — > ^fejS^ is 0, 
which means that the function in discussion must be 0. This means that ft, = v, or equivalently that [i — v. 
Let £(n, v) stand for the Levy distance which induces the weak topology on V(M). Let d(/i n ,//) > 0. 

n — >oo 

Assume now that there exists e > and a subsequence such that C(ii nk ,fi) > e. Otherwise said, the 
sequence /i„ has a subsequence which is not convergent to /i. Since, we are dealing with probability 
measures, there is a subsequence fx nk which is vaguely convergent to a measure v with total mass less than 
1. This means that for any continuous function <fi which is vanishing at infinity, we have that 



4>dn nki - ^> / 4>dv. 
We can apply this for functions 4>{x) = e -\ ax + b \ where a^0 and infer that 

e -\ ax+b ^ nk (dx) ► / e-\ ax+h \ V (dx) for all a ± 0, b G E. 

On the other hand, because d(fJ, nk , (i) > 0, these considerations result with 

Further, using the dominated convergence for b — and a — > 0, we obtain that v is a probability measure. 
From the discussion at the beginning of this proof, it also follows that v — fj, and this in turn results 
with n nk being weakly convergent to /i, a contradiction. This proves that the convergence in the metric d 
implies weak convergence. 

It is obvious that d((i, v) < 1 for any measures \x and v. For the case of discrete measures, we also have 

that 1 > d(6 a ,6 b ) > J e- a \ x - a \5 a {dx) - / e - a ^ a \S b (dx) for any a > 0, which yields that 1 > d(S a ,6 b ) > 
l_ e - a \b-a\ for all a > o Letting 

a — > oo, we get that d(S a , S b ) = 1 for a ^ b which shows that convergence 
in d is strictly stronger than convergence in the weak topology. 

2. From the fact that for any finite positive measure /i, 



(1 - e-^)n{dx) = / ae~ Q M(y, oo))dy, 

(0,oo) J(0,oc) 



we deduce that 



i - a \*- a \(n-u)(dx)= / ae- a »[F M (a-y)-F li (a + y)-F v (a-y)+F v (a + y)]dy 

J(0 : oo) 



which easily yields (3.13). 
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3. We actually show that if p n and p are compactly supported probability measures such that 

log \x - y\ p(dx)p(dy) < oo, - / / log \x - y\ p n (dx)p n (dy) < oo 



and 

lim / / log \x - y\(n„ - p)(dx)(p, n - p){dy) = 0, 



n — >oo 



then \Fn — FA U ► 0. From (3.10) and the first part, we obtain that p n converges weakly to p. In 

addition, none of the measures p n or p have atoms. Thus and are continuous functions which 
combined with the weak convergence implies that F^ n converges pointwise to F p . Since the functions F^ n 
and Fp are distributions of probability measures, it is an easy matter to check that the convergence is 
actually uniform. □ 

Remark 3. We do not know if the topology of convergence in d is the same as the one defined by the metric 

| F[X Fjy I U . 

This result might leave one wondering if a stronger convergence takes place. In other words, is it true that 
Ev(pn) * E v (p v ) implies \\p n — Pv\\v ► 0? To this end, we can consider V(x) — log | H+^ x2 ~ l| anc i 

n — >oo n — *oo 

notice (see [27, page 46]) that \xy is the arcsine law of [—1,1]. Thus if we consider 

dx . ^ , .{1-T n (x))dx 



^v(dx) = ![_!,!] (a;) — , ^ n {dx) = lr_i j i](a;) 

7TVl — X z 



iry/l — x 2 



then, using the same argument which led us to (3.4), with h n there replaced by h n (x) — cos(nx) here, one arrives 
at E(/j, n ) — E(/j,y) = i while the total variation distance is — ^tylU > 1/4. 

4 Log-Sobolev Inequality 

In this section, we develop similarly the mass transportation method to prove the Log-Sobolev inequality in the 
free context. Note again that, as discussed in the introduction, the first assertion for strictly convex potentials 
was initially proved by large matrix approximation in [3]. 

Before we state the main result, we define inspired by Voiculescu [ ], the relative free Fisher information as 

I{n) = J (Hfi(x) - V'(x)) 2 fi(dx) with Hfi(x) = p.v. J - 2 — fi(dy). (4.1) 

for measures /ionR which have density p — dfi/dx in L 3 (]R). In this case the principal value integral is a function 
in L 3 . Otherwise we let I(fj,) be equal to +oo. 

Theorem 4 (Log-Sobolev). 1. If V is C 2 and V(x) — px 2 is convex for some p > 0, then for any probability 
measure p, on R. 

E(p)-E(p v )<^I(p). (4.2) 
Ap 

Equality is attained for the case V(x) = px 2 and p — 0#pv- where 9{x) = x + m. Thus the inequality (4.2) 
is sharp for translations of py ■ 

2. If V is C 2 and V(x) — p\x\ p is convex for some p > and p > 1, then for any probability measure p on K, 
E(p)-E(p v ) < ^I q (p) where I q (p) = J \Hp(x) - V'{x)\ q p{dx) (4.3) 

where here q is the conjugate of p i.e. 1/q+l/p = 1 and the constant k p — (pc p ) q ^ p jq, with c p from (2.11). 

Proof. 1. We will assume that the measure p has a smooth compactly supported density as the general case 
follows via approximation arguments discussed in details in [ ]. Take the (increasing) transport map 8 
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from nv into p. We write the inequality (4.2) in the following equivalent way 

^ J (Hfi(9{x))-v'(e(x))) 2 ^ v (dx) + J (v{x)-v(e(x))-v'(e(x))(x-e(x)))n v {dx) 

-J - V'{6{x))){x - 0(x))n v (dx) 

+ J Hfi(0(x))(x - 9{x))p v {dx) - Jj 'log * ~ y p v (dx)p jV (dy) > 0. (4.4) 

Notice now that from the convexity of V(x) — px 2 , one obtains that 

V(x) - V(9(x)) - V'(0(x))(x - 9(x)) > p(x 2 - 9(x) 2 - 29(x)(x - 0(x))) = p(x - 9{x)) 2 . (4.5) 

Now, 

Hfj,(0(x))(x-0(x))n v (dx) = J (x-0(x)) J 2 _ p, v (dy)p v (dx) = J J K ^ _ V - l^j fi v (dx)p, v (dy) 

(4.6) 

where one has to interpret the second integral here in the principal value sense, however since 9 is increasing, 
the last integral is actually taken in the Lcbesgue sense. 
Using these, equation (4.4) may be rewritten as 



1 



[{Hp(9(x)) - V'{0{x)) - 2p(x - 9(x))] 2 p v (dx) 



' " - 1 - log * V a( z ) pv(dx)p v (dy) > 



9{x) - 9{y) b 9{x) - 9{y) 



which is seen to hold since u — 1 — log(u) > for u > 0. 

Equality is attained for the case V(x) — px 2 and 9(x) — x + c, which corresponds to the translations of 
the measure py. 

2. With the same arguments used in the above proof and the proof of Theorem 2, we use equations (2.18) 
and (2.19) to argue that 



kp 
ffl/P 



> 



Hp{x) - V'{x)\ q p{dx) - E(p) + E{p v ) 

r k 



p q/p 



Hp(9{x)) - V'(9(x)\ q + (V'(9(x)) - Hp(9(x))) (x - 9{x)) + c p p\x - 9(x)\ p p v {dx) 



1 ~ lo S 7^7^ ^TT ) Vv{dx)p v (dy) 



0{x) - 9{y) b 9{x) - 9{y) 

> 

where we used Young's inequality a q /q + b p /p > ab for a, b > and the constant k p = (pc p ) q ^ p /q. □ 
Remark 4. It was proved in [24] that a Log-Sobolev inequality always implies a transportation inequality. 



5 HWI Inequality 

This section is devoted to the free analog of the HWI inequality of Otto and Villani [25] in the classical context, 
connecting thus the (free) entropy, Wasserstein distance and Fisher information. As we will see, the HWI implies 
the Log-Sobolev inequality for strictly convex potentials. This free HWI inequality was not considered before, 
and in particular it is not clear whether there is a random matrix proof, delicate points involving the Wasserstein 
distance entering into the proof. 

Theorem 5 (HWI inequality). 1. Assume that V is C 2 such that for some p G R, V(x) — px 2 is convex. 
Then, for any measure p G V(M), 

E{p)-E{p v ) < ^I(pjW 2 (p,p v )-pW 2 (p,pv). (5.1) 
In the case V(x) = px 2 , the inequality is sharp. 
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2. If V is C 2 and V(x) — p\x\ p is convex for some p > and p > 1, then for the same constant c p appearing 
in Theorem 2, we have that 

E(p)-E(p v ) < JV9( M ) Wrfanv) -pCrWffanv), (5.2) 

where 1/p + X/q = 1 . 

Proof. 1. We employ here the notations used in Theorem 4 and we will give a proof of the inequality for the 
case of a measure p with smooth and compactly supported density, the general case follows through careful 
approximations pointed in [18]. The inequality to be proved can be restated as (5.3) + (5.4) + (5.5) > 0, 
where 

1/2 

(5.3)= ( / (Hfi(9(x))-V'(e(x))Yfi v (dx) I (6(x)-xYfi v (dx) 



(H(i(8(x)) - V'(e(x)))(x - 9(x))fj, v {dx) (5.3) 
(5.4) = / [V(x) - V(9(x)) - V'(6(x))(x - 6{x)) - p(9(x) - xf]p v (dx) (5.4) 



(5.5) = J Hp(9(x))(x - 6{x))p v {dx) - J J log * _ y p v {dx)p v {dy). (5.5) 

A simple application of Cauchy's inequality shows that (5.3) > 0. Using convexity of V(x) — px 2 we have 
from equation (4.5), that (5.4) > 0. Finally, using (4.6), we have that 

(5 - 5) - // (<ra> - 1 - bs J™) ""«*^ ( « £ »• 

which finishes the proof of (5.1). For the case V(x) — px 2 , we have equality if 6(x) — x + m. 
2. The inequality we want to prove is equivalent to the statement that (5.6) + (5.7) + (5.8) > 0, where 



(5.6) 



\Hp{0(x))-V'(6(x))\ q p v (dx) 



i/</ 



(6(x)-x) p p v (dx) 



i/p 



(Hp(6(x)) - V'(6(x))) (x - 9{x))p v {dx) (5.6) 
(5.7) = / [V(x) - V(9{x)) - V'(8(x))(x - 9{x)) - pc p \9{x) - x\ p ]p v (dx) (5.7) 



(5.8) = j Hp{9{x)){x - 9{x))p v {dx) - J J log * _ l {y) Pv(dx) p v {dy). (5.8) 

Now, (5.6) is non-negative thanks to Holder's inequality, equation (5.7), follows from the convexity of 
V(x) — p\x\ p and the combination of (2.18) and (2.19), while equation (5.8) is the same as (5.5). □ 

As pointed out in [ ], HWI inequalities for p > always implies Log-Sobolev. We give here the following 
formal corollary of HWI inequality. 

Corollary 1. 1. If p > 0, then inequality (5.1) implies (4.2) and (5.2) implies (4.1). 

2. If V{x) — px 2 is a convex for some p £ R, then Talagrand's free transportation inequality with constant 

, (C+p) : 

'■> 32C 



C > max{0,— p} implies free Log-Sobolev inequality with constant K — max{p, ^ C £q }■ More precisely, 



VpeV(R), CW 2 (p,p v )<E(p)-E(p v )^VpeV(R), E{p) - E{p v ) < ~ I{p), 



3. In particular, if V is convex and C 2 such that V"{x) > p > for \x\ > r, then free Log-Sobolev inequality 
holds with the constant C > from (2.10). 

Proof. 1. It follows as an application of Young's inequality a p /p + b q jq > ab for a, b > 0. 
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2. For p > 0, everything is clear. In the case p < 0, then, from (5.1) and Talagrand's transportation inequality, 
one has for 8 > 0, that 

E(p) - E(p v ) < y/l(fi W 2 (fi, pv) -pWifji, pv) < idl(p) (E(p) - E(p v )) 

which yields for any 8 > 

AC8 2 

Taking minimum over 8 > gives the conclusion. 

3. In the case V is convex, C 2 and strongly convex for large values, part 2 of Theorem 2 does the rest. □ 



6 Brunn-Minkowski Inequality 

The (one-dimensional) free Brunn-Minkowski inequality was put forward in [24] again through random matrix 
approximation. We provide here a direct mass transportation proof similar to the one of its classical (one- 
dimensional) counterpart (see e.g. [12]). As discussed in [24], this inequality may be used to deduce in an easy 
way both the Log-Sobolev and transportation inequalities. 
The main result of this section is the following theorem. 

Theorem 6. Assume that V\, V2, V3 are some potentials satisfying (2.1) such that for some a £ (0, 1), 

aVi(x) + (1 - a)V 2 (y) > V 3 (ax + (1 - a)y) for all x,yeR. (6.1) 

Then 

aE Vl (p Vl ) + (1 - a)E V2 {p V2 ) > E Va (p Va ). (6.2) 

Proof. Take the (increasing) transportation map 6 from py t into pv 2 ■ This certainly exists as the measure pv r 
has no atoms. 

Noticing that for any measure with finite logarithmic energy, we have the obvious equality 



log \x - y\p(dx)p(dy) = 2 / log(x - y) p{dx) p{dy) . 

J x>y 

Using this we argue that 

f aVi{x) + (1 - a)V 2 {9{x))p Vl {dx) - 2 ff (olog(x - y) + (1 - a)log(0(x) - 6{y)))p Vl {dx)p Vl {dy) 

J J Jx>y 

> / V 3 (ax+(l-a)e(x))fi Vl (dx) -2 // log [(ax + (1 - a)8(x)) - (ay + (1 - a)0(y)}p Vl {dx)p Vl (dy) 

J JJx>y 

= E Vs (v) > E V3 (p V3 ) 

where v = (aid + (1 — a)9)#py 1 and we used (6.1) and the concavity of the logarithm on (0,oo). The proof is 
complete. □ 

7 Random Matrices and a First Version of Poincare Inequality 

In the next three sections, we investigate Poincare type inequalities in the free (one-dimensional) context. We 
discuss two versions of it. The first one is suggested by large matrix approximations and the classical Poincare 
inequality for strictly convex potentials, but will be proved directly. Recall first the classical Poincare inequality 
(cf. e.g. [2], [23], [29], [32]...). 

Theorem 7. Let p(dx) = e~ w ^dx be a probability measure on R d such that W(x) — r\x\ 2 is convex. Then for 
any compactly supported and smooth function </> : R d — > R. we have that 

|V0| 2 o7i >rVar M (0). (7.1) 
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Assume now that V is a potential on R with enough growth at infinity. Consider the matrix models on H. n , 
the space of Hermitian n x n matrices with the inner product (A, B) = Tr(AB*) and the probability measure 
given by 

F n (dM) = -\— e- nTl(v ^dM 
Z n {V) 

where here dM is the standard Lebesgue measure on 7i„. We have that for any bounded continuous function 
F : R -> R, 

1 Tr(F(M))¥ n (dM) ► [ F(x)p v (dx). (7.2) 



n 

Assume in addition that V(x) — px 2 is a convex function on R. Then, consider <I>(Af) = Tr</>(Af), where 
cj) : R — > R is a compactly supported and smooth function. Notice that V<I>(M) = (j)'{M) and thus |V<I>(M)| 2 = 
|0'(M)| 2 = Tr(</>'(M) 2 ). Since nTr(V"(Af)) — np\M\ 2 is convex, we can apply Poincare's inequality on Ti n to 
obtain that 

r Tr((j}'(M) 2 )¥ n (dM) > npVax Pn (Tr(<f>(M))). (7.3) 

The first term in this inequality (cf. equation (7.2)) converges to J 4>' (x) 2 py (dx) . To understand the second 

term in the above equation, notice that Var(Tr(0(M))) = E (Tr(0(M)) - E[Tr(0(M))]) 2 . The study of the 

asymptotic of the linear statistics, Tr(0(M)) — E[Tr(0(M))] in the literature of random matrix is known as 
"fluctuations". From Johansson's paper [19], it is known that this is universal in the sense that the limit in 
distribution of the fluctuations is Gaussian and, at least in the case of polynomial V (for which V(x) — px 2 
fulfills the conditions in there), the variance of the Gaussian limit depends only on the endpoints of the support 
of p,y. Moreover, in the particular case of V(x) = 2x 2 , the variance of the distribution was computed for example 
in [22] and [19] as 

1 f 1 f 1 f cj){t) - <p{s)\ 2 l-ts 



, . . , , dtds. (7.4) 

2n 2 j_ 1 j_ 1 \ t-s j yr~?vr^2 

This variance is interpreted in [8] in terms of the number operator of the arcsine law. We will come back to 
this aspect in Section 9. 

Dividing the inequality in equation (7.3) by n and taking the limit when n — > oo, these heuristics (after a 
simple rescaling) suggest the following result. 

Theorem 8. Assume that V(x) — px 2 is convex for some p > 0. Then for any smooth function (j), one has that 
\2 . ^ 9 f b f b f H^) ~ <f>(y)\ 2 -2ab+(a + b)(x + y) - 2xy 



<t>' (x) 2 pyidx) > / / — v :y yj — — dxdy. (7.5) 

K> 1 '-^ 2 JaJa\ *~V J 2^{x-a){b~x)^{y-a){b-y) V ' 

where supp(/iy) = [a, b]. Equality is attained for V(x) — p(x — a) 2 + (3 and 4>(x) = Ci + C2X for some constants 
ci , c 2 . 

The reader may wonder if the numerator in the second fraction of (7.5) is nonnegative. This is so because 

b — a\ I a + b\ ( a + b 



-2ab+{a + b)(x + y)-2xy = 2 U — j - \x-—j - — J j >0 
for any x, y G [a, b]. 

Proof. Using a simple rescaling we may assume without loss of generality that a = — 1 and 6=1 and the 
inequality we have to show reduces to 



2 

2tt 2 J_ x 7-i V x ~y ) VT^Vi- - ' 



M*)*M*°)>Af I (^M) dxdy. (7.6) 



Then, based on equation (2.5), we have that 



yr^ r 1 v{y)-v'{x) 
iy x ) = o 2 / n — w, — \ d y- 
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From the convexity of V(x) — px 2 , we learn that — y - x — an< ^ thus * na ^ 



^2 ,„ i„o,^ fW v'(y)-v'(x) 
y- 

which implies 



g{x) > Wi-* 2 , (7.7) 

7T 



<j)'{x) 2 pv{dx)>?- 4>'{x) 2 ^/l-x 2 dx. 



l-xy 



Therefore it is enough to check that 

' x ' x) - c/)(y) \ 2 

x-y ) y/l-x 2 ^l-y 2 

for any smooth <fi. Now, we make the change of variables x = cos t to justify 



.! 2irJ_ 1 J_ 1 \ x-y J 



dxdy (7. 



(j)'(x) 2 ^l- x 2 dx= I <p' {cost) 2 sin 2 {t)dt = \ ip'(t) 2 dt 



where ip(t) — cf>(cost). 

On the other hand, using the change of variable x = cost, y — cos s on the right hand side, inequality (7. 
becomes 



4>'(t) 2 dt >~ f f ( ^ ) (1 - cost cos s)dtds. (7.9) 



2 

2tt Jo Jo V cos * — cos s 

To show this, we write ij)(t) — X^fcLo Qfe cos ^ an( ^ then, because ip is a smooth function, we can differentiate 



term by term to get ip'(t) = — J2T=i ka k sinkt, therefore 

J ° k=i 



2„2 



and 

2 



C C ( ^(t) — ib(s) \ ,„ , , , v— * f n f* ( cos kt — cos ks) (cos It — cos Is) (1 — cos t cos s) , , 

/ / — ^ Z±-L) (1-cos t cos s)dtds= > a fe a ; / / > '-dtds. 

Jo Jo \cost~cossJ J J Q (cos t- cos s) 2 

To compute the integrals on the right hand side of the above equation, we take the generating function of these 
numbers and with a little algebra one can show that 

Ei- i C f 7 * (cos kt — cos ks) (cos It — cos ls)(l — cos t cos s) , , 
u v / / ~? — : vr 1 dtds 
' 3 Jo (cos t — cos s) 

r ( u - u3 )( v " f 3 )(l " cos t cos s) ^ ^ 71[) ^ 

o J Q (1 + u 2 — 2ucost)(l + u 2 — 2ucoss)(l + v 2 — 2vcost)(l + v 2 — 2vcoss) 



k,l = l 



tt 2 uv 



= TT 2 ^2ku k V k 



(1 - to) 2 

v ; k=l 

for all u,v E (—1, 1). The last integral can be computed as follows. First use partial fractions to justify 

(A + Bcost)dt _ r Cdt r Ddt _ C/2 D/2 



o (1 + u 2 — 2u cost) (1 + v 2 — 2v cost) J Q l + u 2 — 2ucost J 1 + v 2 — 2vcost 1 — u 2 1 — v 2 

where the constants C,D are linear combinations of A and B. Further, taking A = 1 and B = — coss and 
repeating once more the partial fractions argument, one can cary out the proof of (7.10). 
The main consequence of the above calculation is that 



(cos kt — cos ks)(cos It — cos is) (1 — cost coss) 



dtds = 7r kS 



and that 



o J (cost — coss) 2 

r r f m - <m \ (1 _ t cos s)dtds = kai. (7.11) 

J J \cost-cossJ 
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Therefore inequality (7.9) becomes equivalent to 



fe=i fc=i 

which is obviously true. Notice that equality in this inequality is attained for the case — for all k > 2 and 
arbitrary en.. This corresponds to the case ip(t) = c% + C2COst or <j>{x) — c^x + ci for some c\,C2- 

Finally we point out that equality in (7.6) is attained if the equality is attained in (7.7) and (7.9). From there 
one can easily see from rescaling that equality in (7.5) is attained for V(x) — p(x — a) 2 + [3 and <f>(x) = c\ + C2X. 
The proof of Theorem 8 is complete. □ 

In the above proof we showed a direct calculation for equation (7.11) which is natural in the course of the 
above proof. However, there is another way of looking at it which will appear below in Section 9 as the kernel 
of the number operator. 



8 A Second Version of Poincare Inequality 

The second version of the Poincare inequality is motivated by the free calculus and the noncommutative deriva- 
tive. It was already investigated by Biane [3] for the case of the semicircular law. 

Definition 1. For a given probability measure fi on R, we say that it satisfies a Poincare inequality if there is 
a constant C > such that 

(^^T~) H(te)li(dy) > CVar M (0) for every <j> e Cg(R). (8.1) 

By the best constant we mean the largest C > for which the above inequality is satisfied and we denote it by 
Poin(/x) or Xi(ii) or SG(fi). 

In the noncommutative setting for a given function <f>, we can think of D<j)(x, y) = as the noncom- 

mutative derivative of 4>. As pointed out by Voiculescu in [30], this is the unique map D : C (x) — > C (x) <g> C(x) 
such that 

1. Dl = 

2. D(fg) = D(f)g+fD(g) for any f,ge C(x). 

First we collect a couple of obvious properties of the Poincare constant. 
Proposition 2. 1. For any a^O, 

Poin((ax + 6)#m) = ~^ Poin(it) 

where here and elsewhere, for a given function f : K — > R, /#A* * s push forward measure given by 
(f # »)(A)=»(f-\A)). 

2. If f : R — > M is a differential map such that \f'(x)\ > c > for all then 

Poin(^) > c 2 Poin(/ # /x). 

3. If {/J, n }n>i is a sequence of probability measures which converges weakly to n, then 

Poin(/i) > limsupPoin(/i„). 

n — *oo 

Next we describe some bounds for the Poincare constant. 

Theorem 9. Assume that the measure \x has compact support and is not concentrated at one point. Then [i 
satisfies a Poincare inequality with 

d4- Poin(M) -v4) (8 - 2) 

where d(/j,) — diam(supp(/x)) is the diameter of the support of ft, andVai(n) = J x 2 fi(dx) — ( J xfi(dx)^ 2 . Equality 
on the left in (8.2) is attained only for the case 

/i = aS a + (1 — a)Sb, a < b, < a < 1. 
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Equality on the right of (8.2) is attained only for the case of a semicircular law (a £ K, r > 0) 

Kdx) = ^"^l[a-2r.a+2r](a;)V 4r2 ~ ( X ~ a) 2 dx. 

In addition, assume that V is a C 2 potential on K such that for some integer p and real p > 0, V{x) — px 2p , is 
convex and \i is the minimizer of 



V(x)fi(dx) II log \x - y\ (j,(dx)fjt(dy) 
over all probability measures o/R. Then 

< Poin(^). (8.3) 

In particular if p = 1, we get that £ < Poin(/i). 

Proof. For a given function <f> £ Cq(M), the left hand side of (8.2) follows from 

Var„(« = \jj WW - 0( S )) 3 M(<fc)f ») = \jj{x-yf ( ^I^ ) 

<mjj(m^\ (dx)m) . (8 . 4) 

The right hand side of (8.2) follows from (8.1) for a S Cq(M) such that = a; on the support of /i. 
For measures /i = aS a + (1 — a)8b, condition (8.1) is equivalent to 

Ca(l - a)(<t>{b) - 0(a)) 2 < a 2 {cp'{a)) 2 + (1 - a) 2 (</>'(6)) 2 + 2a(l - a) for any e Cq(M). 

Since for any function <E Cq°(]R) we can find another function ip £ Cq(K) so that cf>(a) = i/j(a) and 0(6) = i/j(b) 
and "0( a ) — 0; V'(^) — 0j this is also equivalent to 

Ca(l - a){^{b) - V>(o)) 2 < 2a(l - a) ( ^I^ ) for my ^ g 

This amounts to C < 2/(6 — a) 2 and therefore, in this case, Poin(/i) = d2 2 ^ . 

Conversely, if fj, is a measure so that Poin(/i) = d2 2 s , then, for 1 > e > 0, there is a function (j) e £ Cq(R) 
such that 

2 , 2 A ir „. /j. \ [ [ { <l>e( x ) - Mv)^ ^ 



+ e 2 Var^) > / / yev ; " €va/ »(dx)ti(dy). 



d 2 {v) J J J \ x - y 

Without loss of generality we can assume that = infsupp(/i), 1 = supsupp(/x) and J <j> e dfi — 0, J 4> 2 diJ, = 1 
where we recall that supp(/i) stands for the support of \x. In this case, the above inequality implies 

2 + e 2 > // ( m^tM) 2 , {dxMdy) + ff ( M*)~MV) \ 2 Kdx)fl{dy) 

JJ\x-y\>l-e\ X—y J J J\x-y\<l~£ \ X ~ V / 

> ff (<i>e( x ) - Mv)) 2 K dx )Kdy) + n , 2 // (<Ae(z) - My)) 2 Kdx)/j,(dy) 

JJ\x-y\>l-e \ l ~ e ) JJ\x-y\<l-e 

= -71 — / / (^( x ) _ My)) 2 Kdx)v(dy) + T , — ^ , 

which results with 

{Ux) - Uy)Y ''»{dx)ti(dy) > 2 - €{ \~ e)2 . (8.5) 

\x-y\>l-e z — e 
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Now, 



.1.1 



(<f> £ (x) - (j> € {y)Y ii{dx)n{dy) < 1 1 (0 e (» - Mv)) 2 ' n{dx)n(dy) 



\x-v\>\-e ' j j\x-l/2\>l/2-e . , 

[x vl - e |j/-1/2|>1/2-£ (p.O) 

<2/i(|x-l/2| > 1/2 -e). 

Thus (8.5) and (8.6) give 

fi(\x — 1/2| > 1/2 — e) > 1 — £ ^ ~ ^ for any 1 > e > 0. 

This shows that /i((0, 1)) = and therefore \i = aiJo + (1 — a)8\. 

The other extreme case of inequality (8.2) is contained in Biane's paper [3] in the more general context of 
several noncommutative variables. For completeness we will provide here a selfcontained proof. In the first place, 
using Proposition 8.1, we may assume that 

fi(dx) = — 1 [_ 2 ,2] (x) \A - x 2 dx 

is the semicircular law on [—2,2]. Take U n to be the Chebyshev polynomials of second kind defined by 
f7„(cos((9)) = sln ^" l g 1 - )e . With this choice, we have that J7„(f) are the orthogonal polynomials with respect 
to /i. The generating function of U n is given by 



oo 1 

J2r n U n (x)= for |x|, |r| < 1 

' 1 — 2rx + r z 



from which one gets 



and then 

U n (x) - U n (y) 



x — y 

y fc=0 



2^2u k (x)U n _ 1 _ k (y). (8.7) 



Now, for a given <f> £ Cq(M), we can write in L 2 (fi) sense, 

oo 
n=0 

yielding from orthogonality and (8.7) that 



Var^) = J 'tfdp-^j '<j>d»j = f>* and J J ( ^_^ ) = f>^. 

It follows that in this case Poin(/i) = 1 = l/Var(/i) and equality is attained only for 4>{x) = Ci+oJJiix) — Ci+c^x 
for some constants c\ , ci . 

To prove the converse, take a compactly supported measure fi and assume that J xfi(dx) — and J x 2 fi(dx) — 
1. In order to show that is the semicircular distribution, it suffices to show that J £/„ (§) /.i(dx) = for all 
n > 1. We use induction to this task. Assuming true for U±, U2, ■ ■ ■ , U n , and using U n +i(x) — 2xU n (x) — U n -i(x), 
we need to show that xU n (§) integrates to against \i. Applying Poincare's inequality to U n (|) + rU\ (|) 
together with the induction hypothesis and equation (8.7), we get that for any r £ R, 

I ui (|) m(^) + r J xu n (|) < J J ^ Mi)-MfA M(d£cMdy)! 

which implies that J xJ7„ (|) //(dx) = 0. 
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In the case of the equilibrium measure of a convex potential V, we have the support of the measure consists 
of one interval [a, b) and a, b solve the system (cf. equation (2.4)) 



If we denote c = (b — a)/2 and j3 = (a+ b)/2, the system above can be rewritten in terms of f3 and c as 



y'(/3 + ci)— =dt = l and — / V '((3 + ct)—=dt = -l 
tit J-i vl-t 2 2-K J_x VI - t 



which is equivalent to 

^ J V'(P + ct)-^J—dt = l and J V'{P + ct)-^=dt = Q. 
Since V is C 2 the first equation can be integrated by parts to get that 

— / V"(0 + ct)y/l-t 2 dt =l. 
On the other hand we know that V"(x) > 2p(2p - l)px 2p - 2 , hence 

1 ^ 2p(2p-l) P c? f\ cb + p?P -,^-fidt 

= P (2p-i) P c 2 p( 2 ;) 

4P(2p - 1) 

= ppQc 2p 

4p 

This yields 

c < 2 mp 

V vp, 

Finally, because d(p) = b — a = 2c, we arrive at (8.3). □ 

To conclude this section, we present an inequality which relates the equilibrium measure of a strong convex 
potential and the arcsine law. 

Theorem 10. Assume that V(x) — px 2 is a convex for some p > and the equilibrium measure py has support 

) . 1 

7rW (b—x)(x — a) 



[a,b]. Let arcsine Q ,f, = l[ ab ](x) — /n = =dx be the arcsine law with support [a,b]. Then for any smooth 
function supported on [a,b], 

(j)'(x) 2 p v (dx) > yoVar arcsin0a b ((j>), 



where the variance is considered with respect to the arcsine a {, law. 

Proof. It suffices to deal with the case a = — 1, b = 1, the rest following by simple rescaling. Recall that in the 
proof of Theorem 8, we use convexity to get that the density g(x) of py satisfies g[x) > £ vl — x 2 . Thus the 
proof reduces to 

1 f 1 

- 4>'(x) 2 y/l - x 2 {dx) > Var arcsino (0). (8.9) 
n J-i 

For this, write <fi — S^Lo a nT n (x) the expansion of <f> in terms of Chebyshev polynomials of the first kind. Now, 
T' n = nUn-i and thus the above inequality reduces to the obvious inequality J2^Li n2a n ^ S^Li a n- O 

We will actually see below that inequality (8.9) is simply the spectral gap for the Jacobi operator associated 
to the arcsine law. 
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9 Poincare Inequalities and Jacobi Operators 



In this section we show how the two versions of the Poincare inequalities can be viewed as spectral gaps for some 
Jacobi operators. This discussion is mainly driven from the work [ ] by Cabanal-Duvillard and his interpretation 
of the variance in (7.4) in terms of the number operator of the Jacobi operator associated to the arcsine law. 
This viewpoint allows for an unified perspective of the Poincare inequalities presented in the preceding sections. 
For our purpose we consider here the Jacobi operators given, for smooth functions on (— 1, 1), by 

L x f(x) = -(1 - x 2 )f" (x) + (2A + l)xf'(x) (9.1) 

for A > 0. We consider the Gegenbauer polynomials C„, A > 0, defined by the generating function 

1 



£r"G*(x) 



2\\ ' 



(1 — rx + r 2 ) 

For A = we set C*(x) = T n (x)/n, n > 1, where T n are the Chebyshev polynomials of the first kind. 
It is known that G A are eigenfunctions of L\, with eigenvalue n(n + 2A), i.e. 

L x Cl = n(n + 2X)C* 

On the other hand the Gegenbauer polynomials are orthogonal with respect to the probability measure 

2 2A r 2 (A+ 1) 



v\ = 



vrr(2A 



^l Mil] W(l-, 2 )^, 



Notice that in the case of A = 0, this becomes the arcsine law and for A = 1, this is the semicircular law, while 
for A = 1/2, this becomes the uniform measure on [—1,1]. 

Take now the normalized Gegenbauer polynomials (f>* — G*/ A/cJ, where c\ = j G^(x) 2 v\(dx). Then </> A 
form an orthonormal basis of L 2 (y\) and thus the operator L\ is diagonalized in this basis. Consider N\ to be 
the counting number operator with respect to the basis (j>^, i.e. 



"TO' 



NxK = Mfc. (9.2) 
This implies that L\ = N 2 + 2AN\. Therefore we have the following two inequalities 

L\ > (2A + l)N x and N x > 1 - P a (9.3) 

where P\ here stands for the projection on constant functions in L 2 (y\). In other words, P\(j> — J <f>v\. 

Notice that equation (9.3) include two statements. The first one is the comparison of L and N, with the 
spectral gap 2A + 1 while the second one is the spectral gap of the counting number operator with the spectral 
gap 1. In the sequel we want to translate these spectral gaps in terms of Poincare type inequality. For this 
matter we need to find the kernel of the operator N. 

Then we have for any function in the domain of definition of L\, that <p = X)^o a n < t > ni ano - then 

oo 

(L(f>, 0)l^ a ) = Yl n ( n + 2A )°V 
On the other hand, using integration by parts, we can justify that 

(Lcf>,<j)) L 2 {ux) = J (t>L x <f>dv x = J (t)'{x) 2 {l - x 2 )v x {dx). 
For the number operator, we have that 

p oo oo 

/ (j)N\<j)di>\ — \ na^ = lim \ nr n ~ a n . 

•J n 1 — n 



n=0 n=0 



Now, for — 1 < r < 1, 



oo „ „ oo 

Ynr^al = // <j>{x)<j>{y) ^ nr^^x^y^dx^dy). 



n=0 n=0 
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Furthermore, since J 4>\dv x = for n > 1, we also obtain that JJ 4> 2 (x)(t)^ l (y)v x (dx)v x (dy) = for n > and 
thus, denoting K x (r,x,y) = - Y^L nr 71 - 1 (j)^{x)4^{y) , 

0(a ; )0(y)^nr"- 1 ^(^)^(y)^(^)^A(dy) = \ If {<t>{x) - <f)(y)) 2 K x (r, x, y)v x {dx)v x {dy). 



The following formula is essentially due to Watson [33] and valid for A > 0, 

(l-r 2 )r(2A) f 1 (l-z 2 )^ 1 



5>"^(^n(v) = 



n=0 



2 2A-i r 2( A ) J_ x ^ _ 2r ^ y + Zy /(l_ x 2)(l_ y 2fi + r 2)l + A 



For A = 0, we have to deal with the Chebyshev polynomials of the first kind which was more or less what 
appeared in the proof of Theorem 8. For this case, we have that (denoting x = cost and y = coss), 



V" T ( \T ( \ = l-rcos(t + s) i - rcu^t. - s) 

2^ n(x) n\y) i_2rcos(t + s) + r 2 l-2rcos(t-s) + r 2 

where c n = J T 2 dv$ = 1 for n = and 1/2 otherwise. 

Thus, we obtain, after differentiation with respect to r and then limit over r | 1, that 



1 — rcos(t — s) 



r(2A) 



(1-z 2 ) 



2\A-1 



K x (x,y) = lim K x (r,x,y) = < 1 - xy 

(x .</)-•• 



2 3A-i r2(A) y ( y+A 

M 1 - a;y - zy/(l - x J ){l - y*)) 



{2(x- y y 



dz, A > 

A = 
A= 1. 



(9.4) 



The integrand is not a rational function. In some cases, it is algebraic since A > need not be an integer. 
To reveal the singularity of this kernel, we make the change of variable 

1-xy- Zx /(l-x 2 )(l-y 2 ) = t(l -xy- y/(l - x 2 )(l - y 2 )) . 

Then, after simple algebraic manipulations, setting f x : (0, 1) — » R, 

-V« [(t-l)(l- u i)] A - :i 



t A+l 



and 



H\(x,y) 



T(2A) (l - xy + V(l-^ 2 )(1-Z/ 2 ))' 

2 3A-i r 2( A ) ( (1 _ x 2) (1 _ y2))A-l/a 

1 - xy, 
1 

12 ' 



(x- y) 2 

(l-xy+ ^/(l-x 2 )(l-y 2 )y J 



A > 

A = 0, 
A= 1, 



(9.5) 



we can rewrite equation (9.4) for |a;|, \y\ < 1 as 



K\(x,y) 



H x {x,y) 



(x -y) 2 

where H x (x, y) is a continuous function of x, y £ [—1,1]. 
Now, from (9.3), we obtain the following result. 

Theorem 11. For any A > 0, one has for all A > and any <f> £ C 1 ([— 1, 1]), that 

2 



(9.6) 



<j)\x) 2 {l-x 2 )v x (dx) > 



2A+ 1 



x-y 



H\{x,y)v\{dx)v\{dy). 



(x) - (j>(y) 
x-y 



H x {x,y)v x (dx)v x (dy) > 2 Vax„ x (</)). 



(9.7) 
(9.8) 
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Remark 5. 1. Equation (9.7) for A = is the statement of Theorem 8 for the case V(x) = 2x 2 and for A = 1 
(more precisely, equation (7.8)) while equation (9.8) is the statement of the second Poincare inequality 
contained in Theorem 9 for the semicircular law. The combination of these two inequalities is equation 
(8.9). 

In other words, for measures v\, the first Poincare type inequality is driven by the comparison of the Jacobi 
and counting number operators defined in (9.1) and (9.2), as the second Poincare type is the spectral gap 
of the counting number operator. 

2. Combining equations (9.7) and (9.8), we also get a Brascamp-Lieb type inequality: 



(j)'{xY{l-x z )v x {dx) > (2A+l)Var„ A (<£). (9.9) 

For A > 1/2, the measure V\ is of the form e~ v ^ x >dx, where V(x) = —c\ — (A — 1/2) log(l — x 2 ), a strictly 
convex function on (—1,1) and according to the classical Brascamp-Lieb inequality [(>], 

V(x) 2 { }~ x2 ]\ x {dx) > (2A-l)Var I/A (^). (9.10) 



(l + x 2 ) 

Notice here that neither (9.9) not (9.10) implies the other which means that they complement each other 
in some sense. For example if <f> has support in f^], (9.9) implies (9.10), while if <fr is supported on 

[-MM-^,^], (9.10) implies (9.9). 

10 Wishart Ensembles and Marcenko-Pastur Distributions 

In this section, we address the preceding functional inequalities for probability measures on the real positive 
axis in the context of the Wishart Ensembles from random matrix theory and their associated Marcenko-Pastur 
distributions. 

We start with the random matrix heuristics although, as far as we know, it has not been used towards 
functional inequalities as before. The problems of large deviations principle for the distribution of the eigenvalues 
of Wishart ensembles is discussed in [16]. The model is as follows. Take T(n) a n x p(n) random matrix with 
all the entries being iid N(0, 1) random variables. Then T(n)T(n)* for n < p(n) is known as the nonsingular 
Wishart random ensemble. According to [17, page 129], the distribution of the Wishart ensembles is given by 

C np e-^ TrM (det M)^- n -^' 2 dM. 

where the measure dM = Yli<j dMij the restriction of the Lebegue measure on the set of n x n non-negative 
matrices. 

It is also known (for example [17, page 129]) that the joint distribution of eigenvalues (Ai, A2, . . . , A„) of 
^T{n)T{nf is given by 



e 



Our interest is in the limit distribution of \i n = A Y^—i ^A 4 - The classical result states that if n/p(n) ► a £ 

(0,1], then the limit distribution of fi n is the so called Marcenko-Pastur distribution given by 



yjAa — (x — 1 — a) 2 
1 [(i-^) 2 ,(i+x/^) 2 ] W ^kc7x 

This is a particular model for the standard Wishart ensembles. However one can consider a more general example 
with potentials for which the distribution of the matrix is driven by a potential Q : [0, 00) — * R, 

C n e-'W^^det M)~> (n) dM 

where dM stands for the Lebesgue measure on n x n positive definite matrices. The distribution of eigenvalues 
of M is given by 

n 

J_ e -p(»)£?^);Q,7(n) -Q % _ t . V 
i—1 \<i<.j<n 
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The main result of [1 6] is that the distribution of the random measures p n — Yli=i ^ under the conditions 
n/p(n) ► a G (0, 1], j(n)/n ► 7 > 0, v n satisfy a large deviation principle with scale n~ 2 and the rate 

n — >oo n — >oo 

function given by 



where 



Rfa) = E q {jjl) - inf E q {jjl), 
Eq{h) = / ot(Q{x) - jlog(x))p(dx) - y //log I a; - y\p(dx)p(dy). 



This gives the following motivation. Assume that V : [0, 00) — > MU{+oo} is a lower semi-continuous potential 
such that liiri| a ;|_ >00 (V(a;) — 2 log |x|) = 00. Then, according to the results in [27], we know that there is a unique 
minimizer of 

inf Ey(u). 

Ate-P([0,oo)) 

In addition the equilibrium measure py na s compact support. 

A particular case of interest is V(x) — rx — slog(a;) with r > 0, s > for which we know [27, page 207] that 
the equilibrium measure is given by 

^ ry/(x-a)(b-x) ^ s + 2- 2^1+1 s + 2 + 2^/I + T 

pv(dx) = lr a feifx) ax where a= . 0= . (10.1) 

2nx r r 

One recovers the Marcenko-Pastur distribution for V{x) — rx — slog(a;), r > 0, s > 0, with r — 1/a and 
s = (1 — a) /a. 

The natural way to deal with functional inequalities in the context of measures on the positive axis [0, co) is 
to transfer measures from [0, co) into measures on the whole K. For a measure fi on [0,oo), consider thus the 
associated symmetric measure jl on R defined as 

H(F) = fi{{x : x 2 e F}) (10.2) 

for any measurable set F of [0, 00). Defining V(x) = V(x 2 )/2, it is then an easy exercise to check that 

E v (n) = 2Ey(fl). (10.3) 

In addition, the minimizer of Ey is /i v = jly Further, for the non-decreasing transportation map 9 of into 
/k, define 

6{x) =sign(x)v/(9(a; 2 ), (10.4) 

which transports jly into jl. 

In addition, as it was pointed out in [ ], the relative free Fisher information Iv(fJ-) is defined for measures 
/i on [0, 00) with density p = d/i/dx in £ 3 ([0, 00), xdx) as 

f°° f 2 

I V (H)= x(Hfjb(x) - V (x)) 2 fi(dx) with Hn(x)=p.v. n(dy). (10.5) 

Jo J x — y 

Otherwise we take /y(/i) = +00. The main reason for defining this in this way is because, cf. [18, Lemma 6.3] 
and the discussion following, one has 

I V {H) = 2I V {P), (10.6) 

where I y is defined by (4.1). 

To state the transportation cost result, we define the appropriate distance. For any /x, v G V([0,oo)), set the 
distance as 

W(ji, v)= inf ( f (y/E - Vy) 2 *(dx, dy)) (10.7) 

where II(/i, v) is the set of probability measures on M 2 with marginals /1 and v. 
In this context we have the following transportation cost inequality. 

Theorem 12. Assume that V : (0,oo) — + K is C 2 ((0, 00)) swc/i i/ia< I^(x 2 ) — px 2 is convex on (0, co) for some 
p > and let py be the equilibrium measure of V on [0,oo). Then, for any probability measure p on [0, 00), we 
have that 

pW 2 (p,p v )<E v (p)-E v (p v ) 1 (10.8) 
In the case ofV(x) = rx — slog(x) with r > and s > 0, this inequality with p — r is sharp. 
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Proof. As announced, the idea is to interpret this inequality as an inequality for potentials on the whole real 
line instead of [0, oo). Using the measures fi and \ly from equation (10.2) together with (10.3), we have that 

Evin) - Ey^v) = 2(Ey{ix) - E y (fi v )). 

On the other hand, if 9 is the (increasing) transportation map of fiy into /i, then it is not hard to check that 

W 2 {^v) = J (y/x- y/9(^j) 2 (l V (dx) = J {x-9{x)ffl V {dx). 

In this framework the inequality (10.8) translates as 

From here we will use the same argument as in the proof of Theorem 2. Start with 
E v (fi) - E v (fl v ) = J (y(6(x))-V(x)-V'(x)(6(x)-x))fl v (dx) 

JJ \ x-y x-y J 

and notice that the second line of this is non-negative. For the first line we point out that because V(x) — ^x 2 
is convex and x and 0{x) have the same sign, for any x, 



V{9{x)) - V(x) - V'{x)(6{x) -x)> ?-{6{x) - x) 2 
which implies (10.8). 

In the case V(x) = rx — slog(x), take 9{x) — {\fx + to) 2 for large m and notice that 9{x) — x + msign(x). 
Therefore inequality (10.9) becomes 

rm 2 < rm 2 +2r m j \x\H(dx)-2s J log ^ + K dx)- JJ log f 1 + m ^tl^M \ ~ K dx)fi(d y ) 

which is sharp for large m. □ 

The next result is the Log-Sobolev type inequality, which was conjectured by Cabanal-Duvillard in [7 , page 
140] for the case of Marcenko-Pastur distribution. 

Theorem 13. Let V be as in the previous theorem. Then, with the definition from (10.5) and for any measure 

n eV([0,oo)), 

E v {y) - Evivv) < i Iv(fJ-)- (10.10) 
2p 

In the case V[x) = rx — slog(x), r > and s > inequality (10.10) with p — r is sharp. 

Proof. We will discuss here the proof only in the case when /j, has a smooth compactly supported density, careful 
approximations being described in [18]. 

From (10.6), we have = 2I v (fl), where I v {p) = J(Hfl(x) — V' (x)) 2 fi(dx) . Rewriting everything in 

terms of ft and the associated quantities, the inequality to be proven can be written in the same way as we did 
in the proof of Theorem 4, 

i J (Hp(9(x))~V'(9(x))) 2 il v (dx) + J ( K V{x)-V{9(x))-V'{9{x)){x-~9{x)))ia v {dx) 

'H0{x)) - V'0(x)j) (x - 9(x))fiy([dx) 



- / IJiHfH.r))U ■«(,-))//,■ Ul.v) - / / log * |— fi v {dx)fL v {dy) > 0. (10.11) 
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Notice that V(x) — %x 2 is not convex on the whole real line but it is convex on the intervals (0, oo) and (— oo,0). 
The key to everything here is that 9{x) has the same sign as x and this allows us to apply convexity of V(x) — ^x 2 
on each of the intervals (—00, 0) and (0, 00) to conclude that 

V(x)-V(6(x)) -V'(6(x))(x-6(x)) > P - (x 2 - 9{x) 2 - 29{x){x - 9{x))) = ^{x-9{x)f. (10.12) 

From here we can follow word by word the proof of Theorem 4. 

For the case V(x) = rx, we have equality in (10.10) if 9(x) = x + rnsign(x) and thus this means 9(x) = 
(y/x + m) 2 . 

In the case V(x) — rx — slog(a;), we look at 9(x) — x + m for large m. In this case V(x) = rx 2 /2 — slog \x\ 
and then a simple calculation shows that (10.10) is equivalent to 

2,o f 1 I ~ (j\ 9 fi f \x + msiga(x)\ \ _ ff ( sign(x) - sign(y) \ 
rm +2mr J \x\pv\dx) — 2s J log I r— I pv(dx) — 2 J J log II + m j fj,{dx)fi(dy) 

2 r / \ 2 

m / s x 



< — / [r — , . — fiv(dx). 

p J \ x{x + msign(x)) J 

Dividing both sides by m 2 and taking the limit of m to infinity implies that p < r. On the other hand p = r 
validates (10.10), hence p = r is the best constant. □ 

Next in line is the HWI inequality which is the content of the following statement. 

Theorem 14. Assume V is as in Theorem 12 and the distance W given by (10.7). Then for any measure 
MGT'QO.oo)), 

Ev(ji) ~ Eviw) < ^2I v (fi)W(p, fi v ) -pW 2 (p, fi V ). (10.13) 
For the case ofV(x) = rx — slog(a;), r > 0, s > 0, this inequality for p — r is sharp. 

Proof. As it was made clear in the previous two theorems, we translate this inequality in terms of the associated 
symmetric measures on R. Following upon the proofs of above theorems, we can rewrite (10.13) in the following 
form: 

(Hp(9(x)) -V' (9{x))) 2 fi v (dx) f(9(x)-x) 2 fl v (dx)) ' - / (h0(x)) - V'(0(x)j) (x - 9{x))p, v (dx) 

2 



V{x) - V(9(x)) - V'{9{x))(x - 9{x)) - p(9{x) - x) 2 j jl v {dx) 
+ j Hfi{9{x))(x - ~9{x))p v {dx) - J J log *~ y fi v (dx)ii v (dy) > 0. 

Using the fact that V(x) — ^x 2 is convex on each interval (—00, 0) and (0, 00) combined with the fact that x and 
9(x) have the same sign, the rest of the proof is the same as the one of Theorem 5. 

For the case V(x) = rx — slog(x), using 9{x) = (s/x + m) 2 , one can show that p = r is sharp. □ 

At last, we would like to discuss a Poincare type inequality in this context. As in Section 7, for the heuristics, 
we consider the general model of random matrices with distribution 

V n (dM) = C n e- nrTrM (detM) sn dM = C n e -^(rM-slo S (M)) dM = ^ e -nTr(V(M)) dM (1Q u) 

where dM stands for the Lebesgue measure on n x n positive definite matrices and s > 0. For a given smooth 
compactly supported function <f> : [0, 00) — ► K, we want to apply the Brascamp-Lieb inequality [ >] to the function 
$(M) = Tr</>(M) on the space of positive definite matrices. Now, V$(M) = 4>'{M). 

The Hessian of W(M) := Tr(V(Af)) can be interpreted as a linear map from TL n (nxn Hcrmitian matrices) into 
itself which is given by V 2 ^(M)X = sM^XM' 1 . Hence the inverse of the Hessian is then (V 2 *(M))" 1 X = 
- s MXM. Thus we obtain from Brascamp-Lieb that 

- Tr((V 2 ^(M))-V(M) 2 )P„(dM) > Varp„($(M)). 
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On the other hand, from [20] or [8] the variance of <!>(M) converges to |Var arcs i nC[a b] (<f>), where we recall that 
arcsine[ 0)fe ] = — , dx is the arcsine law on the support [a,b] of fi v . Next, iTr((V 2 ^(M))~ 1 <j)'(M) 2 ) = 

— Tr (((/)' (M)M) 2 ), whose integral against P„ converges to the integral of ^x 2 (j)'(x) 2 against the equilibrium 
measure fiy from equation (10.1). These considerations suggest that 



x 2 <f>'(x) 2 fi v {dx) > - Var arcsin0[a b] ((f>). (10.15) 



Notice here that one can actually make this heuristic into an actual proof of this inequality. 

Motivated by these heuristics and also inspired by Theorem 8, we have the following stronger result. 

Theorem 15. Assume that Q : [0, oo) — > K is a convex potential and let V(x) = Q(x) — slog(a;) for s > 
satisfy lim. I .^ 00 (y (x) — 2 log(ir)) = oo. Assume that the support of fj,\/ is [a, b] . Then for any smooth function 4> 
on [a,b], the following holds, 

«V(aO V(d*) > A f [ b ( M^tm 2 -2ab+(a + b) (x + y)-2xy 

'-^JaJaK x-y J 2y/(x-a)(b-x)y/(y-a)(b-y) 

If Q(x) = rx + t, equality is attained for 4>{x) = C\ + — , therefore (10.16) is sharp. 

In particular, combining (10.16) with (9.8) for A = 0, we get an improvement of (10.15) as 

x 2 cj)'(x) 2 nv(dx) > - Var arcsin0[a M (4>). 

Equality though is attained only for <fi identically 0. 

In the case V{x) — rx, r > 0, on [0, oo), there is no constant C > such that inequality (10.16) holds with 
C instead of s/4ir 2 . Nevertheless, for every smooth <f> on [a,b], the following holds, 



~ 4W a Ja \ x-y ) 2y/(x-a)(b-x)y/(y-a)(b-y) 

with equality for <j){x) — c\ + Cix. 

As remarked after the statement of Theorem 8, the numerator in (10.17) is nonnegative. 

Proof. The same argument as in the proof of Theorem 8, shows that the density g{x) of fiy satisfies 



Sy /(x-a)(b-x) 

9[x) > 



2nxy ab 

therefore it suffices to show that 

a 

Next, making the change of variable x — (a + b)/2 + u(b — a)/2 and denoting C,{u) = 4>{{a + b)/2 + u(b — a)/2), 
we reduce the problem to showing that for any smooth function on [—1,1], we have 



' f b x^x)W (x-a) { b-x) dx>^ [" l b (m^m\ 2 ; 2ab +{a + b)(x + y)-2xy 
^/abJa ~ 2ir 2 J a J a \ x-y J 2^{x-a){b-x)^{y-a){b-y) 



2 1 - Ml) 

dudv. 

v 1 — u 2 v 1 — V 2 



Denoting (3 = , we have that = ^ 2 , and the preceding inequality reformulates as 

(i + p U )C(u) 2 V^r 2 du>^f^ f f ( c( ^cm) 2 ^_^_^. (10 .i 8 ) 

2tt 7-i 7-i V u ~ v J vi - u 2 vi - v 2 



To show this, take ip(t) = £(cos(t)) and then after the change of variable u = cos(t) we need to check 

27T Jo Jo \cos(t) - cos(s) 



(1 + /3cos(t))^(t) 2 dt > ^\ 132 f P f-^ — (! - cos ( f ) cos(s))dtds. 
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Writing ip(t) = X^^Lo a ™ cos ( n t) an d using that tp'(t) = — X^Li na n sm(nt), together with the fact that 

.... . . . . , If for \m — n| = 1 

cos(i) sm(nt) sm{mt)dt = < 

I otherwise, 

and equation (7.11), the inequality becomes 

£)(n a a* + (3n(n+l)a n a n+l ) > ^l-/3^ na l ( 10 - 19 ) 
n>l n>l 

Let (5 = 1 ^ ^ - be the solution < (5 < 1 of /3<5 2 — 26 + j3 = 0. Notice that for any n > 1, we have 

^ 2 1 2 

a„a„ + i > — - a„ - — a n+1 

which implies that 

E (n 2 a 2 n +Pn(n+l)a n a n+1 ) > £ (n 2 a 2 n - + = £ n/3(1 ~ Q a 2 = v^^ M 2 „, 

n>l ri>l ^ \ ' / / „>! „> 1 

what we had to prove. Notice here that equality is attained in this inequality if and only if a n+ \ — —8a n for 
all n > 1, which means that a n = (— \) n ~ 1 8 n ~ 1 ai. This corresponds to the function ijj(t) — a± i+^+^lcost ' or 
~~ a i i+52+2i5ti which means that 4>(x) — a\(r — s/x). Therefore equality holds also for 4>(x) — c\ + cijx. 
For the second part, in the case V(x) — rx with r > 0, notice that if there is a C > so that (10.16) holds 
with C instead of s/4ir 2 , then, following the same argument as above, we would have the equivalent of (10.19) 
as 

E ( n2fl ™ + n ( n + 1 ) a « a «+i) - C ^2 na n- 

n>\ n>l 

Taking in this a n — ^~ 7 - ) for < 7 < 1, we have that 7 2 /(7 + 1) > — Clog(l — 7 2 ), and this is certainly false 
for 7 close to 1. 

For equation (10.17), notice that in this case the equilibrium measure is /iy(dx) — and then after a 

simple rescaling this follows from equation (7.8). This complete the proof of the theorem. □ 

It is interesting to look at this inequality as a spectral gap result as in Section 9. For example in the case 
of the Marcenko-Pastur measure (Q(x) — rx), the inequality (10.16) is actually equivalent to inequality (10.18). 
Using the interpretation from Section 9, we can rephrase this as, for a given (3 € (0, 1), 

(l + Px){l-x 2 )(t>'(x) 2 Vo (dx) > V1-/3 2 {N(j},4>) Vo 

where vq is the arcsine law on [—1, 1] and N is the number operator. Now we can define the operator 

L (j>(x) = -(1 + [3x){\ - x 2 )(j)"{x) -(0-x- 2fix 2 )$(x). 

With this definition, 

(Lff4>, <t>) Uo = ^J(1 + [3x)cj)'(x) 2 Vl-x 2 dx 
and then inequality (10.18) becomes 

(LpMU > Vl-/? 2 (N<f>,4>)v 

for any smooth function <p on [—1, 1]. In particular this means that Lp > yl — /3 2 N. On the other hand it is 
clear that the operator Lp can not be diagonalized by the Chebyshev polynomials of the first kind, therefore the 
orthogonal polynomial approach given in Section 9 does not work the same way here. 

Remark 6. We want to point out that for the case V(x) — rx — slog(x) for r > and s > 0, the parameter 
r appears in the transportation, Log-Sobolev and HWI, while the parameter s plays the dominant role in the 
Poincare inequality. 
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